Fault-Tolerant Systolic Array Design With Partially- Reconfigurable FPGAs

نویسندگان

  • Nick Iliev
  • Dan Schonfeld
چکیده

Partial dynamic reconfiguration, PDR, is an important feature of modern reconfigurable architectures such as the Xilinx Virtex FPGA devices. In this paper we focus on the application of PDR to fault-tolerant systolic arrays in one dimension. Single as well as multiple faults in one or more array processing elements, PEs, are considered. In our approach modular redundancy is not used and the goal is to maintain total processing latency of the array at the original non-faulty value. This avoids stalling the data input due to reconfigurations of one set of PEs to another set to work around the fault. We consider arrays of N PEs with all possible single-fault locations and formulate an integer linear programming, ILP, model and constraints for each case. The optimal solution selects the PE to reconfigure as a temporary replacement of the faulty PE, so that total processing latency is not increased when the processing clock rate is higher than the data arrival rate. We also develop a heuristic method for solving the ILP problem, which overlaps as many reconfiguration and data processing tasks as possible. The runtime of the heuristic method is very reasonable and matches the exact ILP solution in each case. Experimental results for a single-fault N PE array show that our heuristic finds optimal temporary reconfigurations so that input data does not have to stall.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fault-Tolerant Implementation of Systolic Arrays

In this paper, several fault-tolerant hardware implementations of Systolic Arrays on reconfigurable devices, such as FPGAs, are investigated and primal selection criteria such as resource utilization, computation time, advantages and disadvantages of every fault-tolerant method are presented. Since the main goal of the designs presented herein is the high performance of the implementations, the...

متن کامل

A (Fault-Tolerant)2 Scheduler for Real-Time HW Tasks

This paper describes a fault-tolerant scheduler that uses the AreaTime response Balancing algorithm (ATB) for scheduling real-time hardware tasks onto partially reconfigurable FPGAs. The architecture of the ATB scheduler incorporates fault-tolerance by design features; including Triple Modular Redundancy (TMR), parity protection of its memories and finite state machines, as well as spatial and ...

متن کامل

Evolutionary Dynamic Allocation of Relocatable Modules onto Partially Damaged Xilinx FPGAs

Dynamically and partially reconfigurable FieldProgrammable Gate Arrays (FPGAs) open the doors for new more advanced fault-tolerance techniques. This technology permits indeed to adjust at runtime the architecture of the system in order to keep it fault-free at every time. We present a tailored Genetic Algorithm (GA) aimed at online searching a fault-tolerant architecture for the system in terms...

متن کامل

An Overview of Different TMR Design Techniques with Comparison to Alternative Fault- Tolerant Design Methods for FPGAs

With growing interest in the use of SRAM-based FPGAs in space and other radiation environments, there is a greater need for efficient and effective fault-tolerant design techniques specific to FPGAs. Triple-modular redundancy (TMR) is a common fault mitigation technique for FPGAs and has been successfully demonstrated by several organizations. This technique, however, requires significant hardw...

متن کامل

A Domain Partition Model Approach to the Online Fault Recovery of FPGA-Based Reconfigurable Systems

Field programmable gate arrays (FPGAs) are widely used in reliability-critical systems due to their reconfiguration ability. However, with the shrinking device feature size and increasing die area, nowadays FPGAs can be deeply affected by the errors induced by electromigration and radiation. To improve the reliability of FPGA-based reconfigurable systems, a permanent fault recovery approach usi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011